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Multi-Channel Processing Method 
Field of the inventi n 

The invention relates to a format of audio signals according to claim 1 and a method 
of processing audio signals according to claim 9. 

5 

Moreover, the invention relates to a method of processing audio signals according to 
claim 12, a method of processing audio signals according to claim 13, a method of 
representing an audio signal (AS), said method according to claim 14, a 
1 0 method of decoding a number (M) of directional components () into a, preferably 
lower, nimiber (N) of directional components according to claim 17, a rendering 
system comprising at least one input for receiving a number (M) of directional com- 
ponents (DC) according to claim 18 and a multi-channel data carrier according to 
claim 23 

15 

Background of the invention 

Audio processing and audio rendering are well-known within the art. 

An audio rendering system may typically imply a sound generating unit, e.g. a CD or 
2 0 DVD player and an associated amplifier and loudspeaker system. 

A problem of the rendering known within the art is that the systems are inflexible 
with respect to a possible desired changing of the rendering method. Thus, a change 
from e.g. a two channel stereo rendering into a five channel cinema rendering will 

2 5 typically infer serious technical problems, and the quality of an obtainable rendering 

may be questioned. 

However the number of outputs and the preferred final speaker arrangement most 
often determines how the input signals are being processed, as it is imperative that 

3 0 the sound is modified according to that certain speaker arrangement for the sound 

engineer to give the listener the desired experience. 



CONFIRMATION COPY 
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Consequently sound reproduction is very constrained by the fact that it is necessary 
almost from the beginning to the end of the process to consider which sound format 
and speaker arrangement will be used eventually. And thereby lots of opportunities 
and properties of the soimd are lost almost from the start. 

5 

One of the consequences is that true reverberation or room simulation is almost im- 
possible to achieve by prior art audio formats. 

Another aspect is, that having more sound reproduction formats, including stereo- 
10 phonic formats, surround sound formats and future multi-channel formats, it is nec- 
essary to record and process the sound differently according to each reproduction 
format. 

Preferably there would be some method to record, modify and mix the input signals 
15 without having to consider which reproduction format to use. Then all the heavy and 
significant work on the input signals could be done once, no matter what equipment 
the eventual listener intends to use. 

It is one object of the invention to provide a method for multi-channel sound proc- 
2 0 essing where it is unnecessary to possess knowledge of the final sound reproduction 
format. 

Summary of the invention 

2 5 The invention relates to an audio signal format comprising 

N components (dl, d2, d3, dN) according to claim 1, 

each of the said components (dl, d2, d3, ... dN) representing a direction, said com- 

3 0 ponents preferably being uncorrelated. 



wo 01/11602 



PCT/DKOO/00443 



3 

A component may comprise an accumulation of zero or more signals in a direction 
defined by said component. 

According to a preferred embodiment of the invention, the components should pref- 
5 erably be uncorrelated. At least to the extent that no panning or dependency is es- 
tablished between the signals contained in the format. 

It should be noted, that the signal format according to the invention may be supple- 
mented by further signal components or signal representations within the scope of 
10 the invention. 

When, as stated in claim 2, N components (dl , d2, d3, . . . dN), where N is at least 3, 
a further advantageous embodiment of the invention has been obtained. 

15 When, as stated in claim 3, N is at least 10, preferably at least 20, a further advanta- 
geous embodiment of the invention has been obtained. 

According to the invention, experiments have shown that impressive rendering 
methods may relatively easy be applied to audio signals represented in the format 
2 0 according to the invention when increasing the number of signal components. 

Accordingly, experiments have shown that a ten-directional format (N = 10) may be 
rendered into an impressing stereo image, e.g. by means of a simple gain matrix 
mapping the direction components into the channels of a stereo rendering system. 
2 5 Moreover, an increase of the number of directional components have proven to be 
advantageous when dealing with multi-channel rendering, such as five channel ren- 
dering. 



30 



When, as stated in claim 4,the said directions are three-dimensional directions, a 
further advantageous embodiment of the invention has been obtained. 
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When establishing a three dimensional format, the possibility of establishing a three 
dimensional audio image has been facilitated. 

Evidently, the three dimensional signal format may include both true signal compo- 
5 nents or even "trick"-components. 

When, as stated in claim 5, some or all of said directions are angled in relation to a 
common reference plane and where preferably all of said directions to one and the 
same side of the plane have been placed with approximately the same angle in rela- 
1 0 tion to the common reference plane, a further advantageous embodiment of the in- 
vention has been obtained* 

When, as stated in claim 6, the directions are placed on both sides of the common 
reference plane, where some or all of said directions are angled in relation to the 
1 5 common reference plane and where preferably all of said directions to one and the 
same side of the plane have been placed with approximately the same angle in rela- 
tion to the common reference plane, a further advantageous embodiment of the in- 
vention has been obtained. 

2 0 When, as stated in claim 7, the angle of the directions on one side of the conraion 

reference plane and the angle of the directions on the other side of said plane are sub- 
stantially equal, a further advantageous embodiment of the invention has been ob- 
tained. 

2 5 When, as stated in claim 8, the said directions are distributed among all possible di- 
rections, a further advantageous embodiment of the invention has been obtained. 

According to the invention, an experience-based allocation of directions of compo- 
nents may be applied- 

30 

When, as stated in claim 9, the said directions are distributed with a larger proportion 
of directions in areas with a relatively high density of sound signals than in areas 
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with a relatively low proportion of sound signals, a further advantageous embodi- 
ment of the invention has been obtained. 

When, as stated in claim 10, the said directions are distributed with a larger propor- 
5 tion of directions in areas in which the human perception of sound signals is rela- 
tively sharp, e.g. in front of the head, a further advantageous embodiment of the in- 
vention has been obtained.. 

When, as stated in claim 1 1, said signal is decomposed to a signal comprising N di- 
1 0 rectional components and according to an audio signal format as characterized in one 
or more of claims 1 - 10, a further advantageous embodiment of the invention has 
been obtained. 

Moreover, the invention relates to a method of processing audio signals according to 
15 claim 12, 

said signals comprising 

M sub-signals (si, s2, s3, ... sM), 

20 

each of the said sub-signals (si, s2, s3, ... sM) comprising 
N components (dl, d2, d3, ... dN), 

2 5 each of the said components (dl, d2, d3, ... dN) representing a direction; 

where the said sub-signals (si, s2, s3, ... sM) are added to form a sum-signal (2s) 
comprising 

3 0 N components (Edl , 2d2, 2d3, . . . SdN), 

each of the said components (Zdl , £d2, £d3, . . . SdN) representing a direction. 
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each of the said components (Sdl , 2d2, Zd3, . . . SdN) being the sum of the said M 
sub-signals (si, s2, s3, sM) corresponding components (dl, d2, d3, ... dN), 

5 

According to the invention, the format may advantageously be applied as an inter- 
mediate signal processing format and e,g. different sound sources represented ac- 
cording to the same format may be superpositioned by a simple adding of the signal 
components. 

10 

Moreover, the invention relates to a method of processing audio signals according to 
claim 13, 

said signals comprising 

15 

M sub-signals (si, s2, s3, ... sM), 

each of the said sub-signals (si, s2, s3, ... sM) comprising 
2 0 N components (dl, d2, d3, . . . dN), 

each of the said components (dl, d2, d3, . . . dN) representing a direction; 

where said sub-signals (si, s2, s3, .. sM) are results of a room-simulation using 

2 5 room-simulators, preferably multi-directional room-simulators, 

where the said sub-signals (si, s2, s3, ... sM) are added to form a sum-signal (Ss) 
comprising 

3 0 N components (Sdl, Zd2, Zd3, . . . ZdN), 



each of the said components (Sdl, Zd2, £d3, ... ZdN) representing a direction. 
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each of the said components (Sdl, 2d2, Zd3, . . , SdN) being the sum of the said M 
sub-signals (si, s2, s3, ... sM) corresponding components (dl, d2, d3, ... dN). 

5 Moreover, the invention relates to a method of representing an audio signal (AS), 
said method according to claim 14, comprising the step of 

establishing at least two directional signal components (M), said directional signal 
components (M) preferably being uncorrelated. 

10 

The audio signal may subsequently be decoded into a desired rendering format, such 
as five channel. 

15 Basically, the audio format represents a flexible audio coding in the sense, that the 
audio-signals may be encoded without any knowledge about the rendering system. 

It should be noted that the method according to the invention implies a very advanta* 
geous representation of a signal due to the fact that the final rendering of the signal 

2 0 may be performed strictly according to a simple mapping if so desired, e.g. a gain 
matrix, still maintaining the intended total signal image. The final rendering may 
thus, if desired, be performed solely with focus on the rendering of a signal compo- 
nent having a certain direction parameter and without differentiating between the 
different signals and the type of the signals contained in the directional signal com- 

2 5 ponent. In other words, direct sound and subsequent room simulated signal may be 
rendered in the same way. 



30 



Accordingly, a music mix may be established once and for all in the format accord- 
ing to the invention, and subsequently, when new formats appear on the commercial 
market, a rendering may be established on the basis of the said format. 
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According to the invention, the term imcorrelated signal components implies that 
different directional signal components are independent in a degree that a subsequent 
rendering in a rendering system is possible without compensating for mutual depend- 
encies between different directional signal components. 

5 

According to the invention, the signal components may be subsequently be rendered 
in almost every possible audio rendering system, e.g. stereo, five channel surround 
sound, without considering the originally intended rendering method. 

10 A component includes an accumulation of zero or more signals in a direction defined 
by said component. 

Typically, the method according to the invention may be performed without any (or 
very little) knowledge of the downstream rendering system. 

15 

When, as stated in claim 15, the said audio signal is a room processed signal, a fur- 
ther advantageous embodiment of the invention has been obtained. 

Again, the term uncorrelated signal components implies that the signal components 
2 0 are independent. An independency between the signals of different directional signal 
components of a room simulated signal may specifically imply that signals repre- 
senting a direct soimd signal of a room signal should be independent with other di- 
rect soimd representing signals of other directional signal components. Evidently, 
when dealing with a room simulated signal, a correlation between signals of difFer- 
25 ent signal components will occur within the scope of the invention, due to the fact 
that a room simulation of a direct sound signal will likely produce a subsequent re- 
verberation sound signal bundle distributed in different signal directional compo- 
nents (generally). 

30 It should be noted that the method according to the invention implies a very advanta- 
geous representation of a signal when dealing with room processed signals due to the 
fact that the final rendering of the room simulated signal may be performed strictly 
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according to a simple mapping if so desired, e.g. a gain matrix, still maintaining the 
intended total signal image. The final rendering may thus, if desired, be performed 
solely with focus on the rendering of a signal component having a certain direction 
parameter and without differentiating between the different signals and the type of 
5 the signals contained in the directional signal component. In other words, direct 
sound and subsequent room simulated signal may be rendered in the same way. 

This unique feature implies e.g. that a music recording may be mixed, room proc- 
essed, etc. by a sound engineer and encoded/stored according to the invention. 

10 

The recording may then be transformed into a signal standard fitting to a certain de- 
sired rendering method, e.g. stereo. 

Subsequently, the stored music recording may be redistributed in another format 
1 5 without re-mixing the music recording. 

When, as stated in claim 16, at least two audio signal (AS) the signals are combined 
into one signal by means of an adding, a further advantageous embodiment of the 
invention has been obtained. 

2 0. 

According to the invention, an adding of signal components would preferably in- 
volve the signals (in time) of the same signal directional component of two different 
audio signals are summed. 

2 5 Clearly, according to the invention, the format and the method facilitate a modular 

approach according to which e.g. the room processing may be processed independ- 
ently. 

Moreover, the invention relates to a method of decoding a number (M) of directional 

3 0 components into a, preferably lower, number (N) of directional components accord- 

ing to claim 17, said method comprising the step a transforming of the input direc- 
tional components to a nimiber N of output directional components. 
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said input directional components representing a room simulated audio signal, said 
directional components being preferably uncorrelated. 

The method of decoding the above signal, which may also be regarded as a render- 
5 ing, has proved to be very efficient and effective, due to the fact that the rendering 
itself may be performed by means of simple signal processing if desired. Hence, a 
rendering of the directional components into an X-channel signal (X: more or less 
arbitrary) may be performed by means of a simple gain matrix mapping the M direc- 
tional components into the X number of channels. 

10 

Moreover, the invention relates to a rendering system comprising at least one input 
for receiving a number (M) of directional components (DC) according to claim 18, 
said rendering system comprising means for transforming the input directional com- 
ponents into a number (N) of output channels according to at least one rendering 
1 5 method stored in associated storing means (MSM). 

According to a preferred embodiment of the invention, the rendering system should 
be able to be re-configured, e.g. by means of software. 

2 0 According to a preferred embodiment of the invention M should be significantly 
higher than N. 

The rendering system according to the invention represents a very flexible down- 
stream rendering in the sense that the signal may be completely pre-established pre- 
2 5 vious to the rendering, thereby reducing the task of the rendering system to be a 
mapping. 
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Another advantageous feature is that the rendering system may be specifically de- 
signed to fit to the room (and e.g. amplifier and loudspeaker) in which the sound has 
to be reproduced. 
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According to a preferred embodiment of the invention, a unique spatial relation ex- 
ists between each input signal component and the intended location of an output 
channel of a rendering system. 

5 According to the invention, the means for transforming the input directional compo- 
nents into a number (N) of output channels may perform a signal processing based 
on a simple gain matrix mapping of the signal components into the said output sig- 
nals. Evidently, the means for transforming the input directional components into a 
number (N) of output channels may also additionally apply other signal processing 
1 0 methods, such as delay compensation for compensating the positioning of the loud- 
speakers of the rendering system, a compression applied for adapting the rendered 
output signal to noisy environments, e.g. a cabin of a car. 

When, as stated in claim 19, at least one of the said methods implies a transforming 
15 according to a gain matrix, a further advantageous embodiment of the invention has 
been obtained. 

According to the present embodiment of the invention, the required signal processing 
to be performed in the rendering system may be minimized. 

20 

When, as stated in claim 20, the said method stored in the said storing means may be 
exchanged by means of a suitable software transmitting and/or receiving interface, a 
further advantageous embodiment of the invention has been obtained. 

2 5 According to the present embodiment, the rendering methods may be downloaded 
into the rendering system, e.g. by means of an ERD A port, a traditional RJ 45 signal 
interface, etc. Another way may also be that of incorporating the method defining 
software on a DVD or a CD. 



30 



It should be noted that the bandwidth required for the transmittal of method defining 
software into the rendering may typically be relatively small. 
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When, as stated in claim 21, the rendering system comprises a user interface adapted 
for selecting at least two different predefined rendering methods stored in said stor- 
ing means, a further advantageous embodiment of the invention has been obtained. 

5 According to the invention, a user may adapt his rendering system, to the room in 
which music (or speech) is to be reproduced by simply switching between different 
predefined methods. Evidently, the rendering system should comprise the amplifier 
and associated loudspeakers of the rendering system. 

10 The rendering system may e.g. comprise a set of predefined rendering methods asso- 
ciated with stereo reproduction, a set of predefined rendering methods associated 
with five channel rendering, etc. 

When, as stated in claim 22, said system comprising a set of output channel (OC) 
15 connectors of which the rendering method may define a subset of output channel 
connectors to be activated when applying the transforming defined by the said, a 
further advantageous embodiment of the invention has been obtained. 

According to the invention, a rendering system may feed a traditional amplifier/loud 
2 0 speaker setup (e.g. a stereo setup) by means of two rendering method defined output 
channel connectors if so desired. Moreover, the same system may moreover feed 
another type of rendering system via other method defined output channel connec- 
tors. 

2 5 Accordingly, the rendering system according to the invention may comprises e.g. a 

DVD reading unit and a unit adapted for allocating the directional signal read by the 
DVD into a kind of crossfield output connectors. 

Moreover, the invention relates to a multi-channel data carrier according to claim 23, 

3 0 said data carrier comprising a number (M) of audio channels (M), at least two of said 

channels representing a directional signal with respect to a virtual listener/reference 
position (VLP). 
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According to the above stated embodiment of the invention, an advantageous ren- 
dering of a multi-channel directional audio signal stored at the carrier may be ob- 
tained. Experiments have show^n that subsequent rendering of the signal into e.g. a 
5 multi-channel rendering system, not necessarily predefined at the time of storing the 
audio-signal at the data carrier, may facilitate a relatively simple, and even more im- 
portant high quality rendering. Experiments have shown, that the signal format e.g. 
facilitate a convincing rendering of the preprocessed room simulation. 

10 A further advantageous feature of the invention is that the bandwidth may be kept 
constant irrespective of the character of the stored audio signal, the number of 
sources, etc. 

According to a preferred embodiment of the invention the audio-signal stored at the 
15 data carrier should be almost completely preprocessed with respect to the mixing and 
sound-engineering. 

It should be noted that it is known to present music in different formats at the same 
data carrier, e.g. stereo, five channel audio, Dolby Surround ,etc.. Each format is 
2 0 typically mixed separately, and anyway, each format may hardly be transformed into 
another rendering system. 

But, according to the invention, a music mix may be released once and for all e.g. on 
a DVD, and if another rendering format is desired later, the rendering system may be 
2 5 re-configured without any need for manipulating the origin data source. 

When, as stated in claim 24, the directional channels are established independently of 
a subsequent rendering, a further advantageous embodiment of the invention has 
been obtained 

30 

The directional components are established independently of the subsequent render- 
ing of the audio channels. 
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When, as stated in claim 25, the number of directional charmels are at least eight, 
preferably at least twenty, a further advantageous embodiment of the invention has 
been obtained. 

5 

When, as stated in claim 26, at least two of the directional channels are uncorrelated, 
a further advantageous embodiment of the invention has been obtained. 

When, as stated in claim 27, at least two of the directional channel are stored at the 
1 0 data carrier in a compressed state, a further advantageous embodiment of the inven- 
tion has been obtained. 

If desired, the effective necessary bandwidth of the complete signal represented in 
the format according to the invention may be reduced taken into consideration that 
15 many audio direction components are typically un-occupied in periods. Such un- 
occupied channel components will typically increase when increasing the number of 
channels. 

A possible compression may e.g. be applied according to PCT applicaiton WO 
2 0 97/38493 by Philips. 
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The figures 



The invention will be described below with reference to the drawings of which 

5 fig. 1 shows the basic understanding of a reverberated sound 

fig.2 shows the basic principles of a sound processing device according to 

the invention 

fig.3a-3c shows different sub-portions of the system according to the invention, 
fig. 4a-4b illustrates early pattern generators according to the invention 
1 0 fig. 5 shows a two-dimensional system of co-ordinates for illustrative pur- 

poses, 

fig, 6a-6b shows two embodiments of an audio signal format according to the 
invention, 

fig. 7 shows a three dimensional system of co-ordinates with two additional 

15 planes, 

fig. 8 shows the angle of the elevation of the additional planes shown in fig. 

7, 

fig. 9 shows another embodiment of an audio signal format according to the 

invention, 

2 0 fig. lOa-1 Oc shows fig. 9 divided into sub-planes for the readers convenience, 

fig. 1 1 shows an audio signal processing method according to the invention, 

fig. 12 shows the same as fig 11, but illustrated in an alternative manner, 

fig. 13 shows a processing system according to a fiirther embodiment of the 

invention, 

2 5 fig. 14 shows a recording, distribution and/or reproducing system according to 

a still fiirther aspect of the invention, 
fig. 1 5 shows a recording, distribution and/or reproducing system as illustrated 

in fig. 14, but showing a storing medium for the pre-processed signal 
and the generation of the input signals, 

3 0 fig. 16 shows a additional embodiment of the system illustrated in fig. 14, and 

fig. 17 shows different modules, of a rendering system according to the inven- 
tion. 
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Detailed description 

According to most embodiments of the invention, it is the general approach that arti- 
5 ficial generation of room simulated sound should comprise an early reflection pattern 
and a late sound sequence, i.e. a tail sound signal. 

It should be noted that the invention is basically directed at the early reflection pat- 
terns, and consequently sound processing based on early reflections patterns within 
1 0 the scope of the invention. 

Fig. 1 illustrates the basic principles of a conventional signal processing unit. 

The circuit comprises an input 1 communicating with an initial pattern generator 2 
1 5 and a subsequent reverberation generator 3. In addition, the initial pattern generator 2 
and the subsequent reverberation generator 3 are connected to two mixers 4, 5 having 
output channels 6 and 7, respectively. 

The initial pattern generator 2 generates an initial sound sequence with relatively 

2 0 few signal reflections characterising the first part of the desired emulated sound. It is 

a basic assumption that the initial pattern is very important as a listener establishes a 
subjective understanding of the simulated room on the basis of even a short initial 
pattern. 

25 An explanation of this performance is that this signal reception corresponds to the 
actual sound propagation and reflection in a real life room. 

Hence, reflections in a certain room will initially comprise relatively few reflections, 
as the first sound reflection, also called first order reflections, have to propagate fi-om 

3 0 a sound source at a given position in the room to the listener's position via the near- 

est reflecting walls or surfaces. Compared with the overall heavy complexity of the 



wo 01/11602 



PCT/DKOO/00443 



17 

technique, this sound field will be relatively simple and may therefore be emulated in 
dependency of the room and the position of the source and the listener. 

Subsequently, and of course with some degree of overlapping, the next reflections 
5 will appear at the listeners position. These reflections, also called second order re- 
flections, will be the sound waves transmitted to the position of the receiver via two 
reflecting surfaces. 

Gradually, this sound propagation will increase in dependency of the room type, and 
10 finally the last reflected sound will be of a more diffuse nature as it comprises several 
reflections of several different orders at different times. 

Apparently, the sound propagation will gradually result in a diffuse sound field and 
the sound field will more or less become a "sound soup". This diffuse sound field 
15 will be referred to as the tail sound. 

If the walls have high absorption coefficients, the propagation will decrease quite fast 
after a short time period of time while the sound propagation will continue over a 
relatively long period of time if the absorption coefficients are low. 

20 

Fig. 2 illustrates the basic principles of a preferred embodiment of the invention. 

For reasons of explaining, the shown embodiment of the invention has been divided 
into three modules 20A, 20B and 20C. 

25 

The first module 20A of the room simulator, according the embodiment shown, 
comprises M source inputs 21, 22, 23. 



30 



The source inputs 21, 22 and 23 are each connected to an early pattern generator 26, 
27 and 28. 
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Each early pattern generator 26, 27 and 28 outputs M directional signals to a sum- 
ming unit 29. The summing unit adds the signal components of each of the N prede- 
termined directions from each of the early pattern generators 26, 27 and 27. 

5 The summing unit output N directional signals to the module 20B comprising direc- 
tion rendering unit 201 . 

The basic establishing of the N directional signals has been illustrated in fig. 3a. 

1 0 Now returning to fig. 2, the direction rendering unit converts the N directional signal 
to a P channel signal representation. 

The basic establishing of the P channels of module 20B has been illustrated in fig. 
3b. 

15 

Moreover, the system comprises a third module 20C. The module 20C comprises a 
reverb feed matrix 202 fed by the M source inputs 21, 22, 23. The reverb feed matrix 
202 outputs P channel signals to a reverberator 203 which, in turn, outputs a P chan- 
nel signal to a summing unit 204. 

20 

Thus, the summing unit 204 adds the P channel output of the reverberator 203 to the 
output of the direction rendering unit 201 and feeds the P channel signal to an output. 

The basic establishing of the P channels of module 20C has been illustrated in fig. 
2 5 3c. 

Before explaining the overall functioning of the algorithm, the basic functioning of 
the early pattern generators 26, 27, 28 and the summing unit 29 will be explained 
with reference to fig. 3a 



According to fig. 3a, the module 20A comprises a number of inputs SI, S2, S3 and 
S4. 
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It should be noted that a number of four inputs have been chosen for the purpose of 
obtaining a relatively simple explanation of the basic principles of the invention. 
Many other input numbers may be applicable. 

5 

Each of the inputs are directed to an early pattern generator 26, 27 and 28, Each early 
pattern generator generates a processed signal specifically established and chosen for 
the source input SI, S2, S3 and S4. The processed signals, according to the shown 
embodiment, are established as a signal composed of seven signal components dl, 
10 d2, d3, d4, d5, d6 and d7. The seven signal components represent a directional signal 
representation of the established soimd and the established signal contains both the 
direct sound and the initial reverberation sound. 

A possible embodiment of the invention impHes a five channel rendering of 10- 
15 directional signal where the directions of the input signal format are 0, +A15, +7-30, 
-I-/-70, +A110 and 180 degrees, and the intended location of the five corresponding 
loudspeakers are 0, +/-30 and +A1 10 degrees according to ITU 775. 

Obviously, several other directions and locations may be applicable. A preferred em- 
2 0 bodiment comprises more than 20 directions. 

Accordingly, each of the inputs SI, S2, S3 and S4 may refer to mutually different 
locations of the input source to which the early pattern is generated. 

2 5 Successively, the signals fi^om each source are summed in summing unit 29. The 

summing is carried out as a simple adding of each signal component, i.e. 
dl:=dl(Sl)+ dl(S2)+ dl(S3)+ dl(S4), d2:=d2(Sl)+ d2(S2)+ d2(S3)+ d2(S4), 
d3:=d3(Sl)+ d3(S2)-f d3(S3)+ d3(S4), d4:=d4(Sl)+ d4(S2)+ d4(S3)+ d4(S4), 
d5:=d5(Sl)+ d5(S2)+ d5(S3)+ d5(S4), d6:=d6(Sl)+ d6(S2)+ d6(S3)+ d6(S4) and 

3 0 d7:=d7(S 1 d7(S2)+ d7(S3H d7(S4). 
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It should be noted that, even though undesired, according to the preferred embodi- 
ment of the invention, the signals dl,..,d7 may comprise tail sound components or 
even whole tail-sound. It should nevertheless be emphasised that according to the 
preferred embodiment of the invention such tail sound may advantageously be gen- 
erated according to a relatively simple panning algorithm and subsequently added to 
the established summed initial sound signal as the established summed initial sound 
comprises the dominating room determining effects. 



Moreover, it should be emphasised that a separate tuning of the resulting tail-sound 
1 0 signal is much easier when made separately from the individual tuning of the differ- 
ent source generators. 

Turning now to module 20B, fig. 3b illustrates the basic functioning of the direction 
rendering unit 201. 

15 

According to the shown embodiment of the invention, the seven directional signal 
outputs from module 20A are mapped into a chosen multi-channel representation. 
According to the illustrated embodiment, the seven directional signals are mapped to 
a P=5 chaimel output. 

20 

According to a preferred embodiment of the invention, the type of multi-channel 
representation is a selectable parameter, both with respect to number of applied 
channels and to the type of speaker setup and the individual speaker characteristics. 

2 5 The conversion into a given desired P channel representation may be effected in sev- 

eral different ways such as implying HRTF based (head related transfer function), a 
technique mentioned as Ambisonics, VBAP (vector based amplitude panning) or a 
pure experience based subjective mapping. 

3 0 Turning now to fig. 3c module 20C is illustrated as having an input from each of the 

source inputs SI, S2, S3 and S4. The signals are fed to a reverb feed matrix 202 
having five outputs, corresponding to the chosen channel number of the direction 
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rendering unit 201 . The five channel outputs are fed to a reverberation unit 203 pro- 
viding a five channel output of subsequent reverberation signals. 

The reverb feed matrix 202 comprises relatively simple signal pre-processing means 
5 (not shown) setting the gain, delay and phase of each input's contribution to each 
reverb signal and may also comprise filtering pre-processing means. 

Subsequently, the reverberation unit 203 establishes the desired diffuse tail sound 
signal by means of five tank circuits (not shown) and outputs the resulting sound 

10 signal to be added to the already established space processed initial sound signal. 
According to the illustrated preferred embodiment of the invention, the tail sound 
generating means are added using almost no space processing due to the fact that a 
space processing of the tail sound signal according to the diffuse nature of the signal 
has little or no effect at all. Consequently, the complexity of the overall algorithm 

1 5 may be reduced when adding the tail sound separately and making the tuning much 
easier. 

Moreover, it should be noted that the above mentioned separate generation of the 
tail-sound provides a more natural diffuse tail-sound due to the fact that the distinct 
2 0 comb-filter effect of the early pattem generator should preferably only be applied to 
the initial pattem in order to provide naturalness. 

It should be noted that the above generation of subsequent reverberation signals, ac- 
cording to the present preferred embodiment, is generated independently of the initial 
2 5 sound generation. Nevertheless, it should be emphasised that the invention is in no 
way restricted to a narrow interpretation of the basic generation of a reverberation 
sound. Thus, within the scope of the invention, both the initial sound and the sound 
tail of each sound may of course be located within an artificial room and subse- 
quently summed in a summing unit. 

30 

Turning now to fig. 4a, an early pattem generator, such as 26 of fig. 2, is illustrated 
in detail. The early pattem generator is one of four according to the above described 
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illustrative embodiment of fig. 2, and each generator comprises a dedicated source 
input SI, S2, S3 and S4. 

The shown early pattern generator 26 comprises a source input SI. 

According to the shown embodiment, the source input is connected to a matrix of 
signal processing means. The shown matrix basically comprises three rows of signal 
processing lines, which are processed by shared diffusers 41, 42. 

Accordingly, the upper row is fed directly from the input SI, the second row is fed 
through the diffuser 41 , and the third row is fed through both diffusers 41 and 42, 

Each row of the signal processing circuit comprises colour filters 411, 412, 413; 421, 
422, 423; 431, 432, 433. According to the shown embodiment, colour filters of the 
same columns are identical, i.e. colour filter 41 1=421=431. 

It should nevertheless be emphasised that the colour filters may of course differ 
within the scope of the invention. 

Moreover each row comprises delay lines 4111,4121 and 4131 which are serially 
connected to the colour filters 411,412, 413. Finally, each column may be tapped via 
level and phase controllers such as 4000, 4001 and 4002. It should be noted that each 
level-phase controller 4000, 4001 and 4002 are tap specific. 

Hence, the initial pattern generator 26 comprises a matrix which may comprise sev- 
eral sets of predefined presets by which a certain desired room may be emulated. 

As already mentioned and according to the simplified embodiment of the invention, 
signals of the current predefined room emulation are tapped to the directional signal 
representation of the present sound source SI. According to the illustrated program- 
ming, four signal lines are tapped to seven directional signal components. One signal. 
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N13 of row 1, column 3, is fed to sound component 1, one signal, N21, is fed to sig- 
nal component 3, and two signals, Nl 1 and N22 are added to the sound component 4. 

It should be noted that each tapped signal has consequently been processed through 
one of three combinations of diffusers, one of three types of predefined colour filters 
EQ, a ft^eely chosen length of delay line and a fi-eely chosen level and phase output. 

Obviously, several other combinations and number processing elements are applica- 
ble within the scope of the invention. 



According to one of the preferred embodiment of the invention, a separate row with a 
level-phase controller 4002 should be tapped and determine the direct sound. When 
integrating the direct sound into the early pattern generation, the location of both the 
direct sound as well as the corresponding EPG and reverberation sound signals may 
15 be mapped into the sound signal representation completely similar to the desired di- 
rectionality irrespective of directional resolution and complexity. 

Evidently, the directional signal representation components usually comprise signals 
fed to each component 1 -7 and not only the illustrated three. 



It should be noted, that the chosen topology of the early pattern generator within the 
scope of the invention may be chosen fi-om a set of more or less equivalent topolo- 
gies. Moreover, the signal modifying components may be varied, if e.g. a certain 
degree of tail-sound is added before or after tapping. 

As the illustrated early pattern generator comprises linear systems, it will be possible 
to interchange the components, e.g. the colour filters EQ may be interchanged with 
the diffusers DIF. 



3 0 Fig. 4b illustrates a further possible embodiment of the early pattern generator, com- 
prising colour filters EQ placed in the feed line to each row and diffusers DIF placed 
in each column in each row. 
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Likewise, the number of columns and rows may vary depending of the system re- 
quirements. In a possible embodiment only one column of delay lines with corre- 
sponding colour filters or diffusers is utilised. Moreover, additional components, 
5 additional diffusers, additional different types of colour filters, etc. may be chosen. 

Finally it should be mentioned that, according to a preferred embodiment of the in- 
vention, the number of directions, i.e. signal components, should be not less than 
twelve, and the established reflections of each early pattern generator should not be 
10 less than 25. 

The basic presetting of each early pattern generator may initially be determined by 
known commercially available ray tracing or room mirroring tool, such as ODEON. 

15 To describe the invention, it will be necessary for illustrative and explanatory pur- 
poses first to define a plane (and later a room) wherein the elements of the audio 
format according to the invention may be placed. 

Fig. 5 illustrates a two-dimensional system of co-ordinates, with the axes labeled 'x' 
20 910 and 'y' 91 1 . The axes 910, 91 1 are perpendicular to each other, and both are 
parallel to the groimd, i.e. they are arranged in a horizontal plane. At the systems 
origin a head of a person 912 is placed, with the nose pointing in the same direction 
as the X-axis 910. A circle 913 with radius of a unit and center at the systems origin 
is drawn in the plane of the x- and the y-axes. 

25 

Fig. 6a illustrates an audio signal format. It shows again the two dimensional system 
of co-ordinates of figure 5. It comprises the axes x 910 and y 91 1, and the unit circle 
913. The intersection 912 of the two axes 910, 91 1 represents the position of the head 
912 of figure 5. 

30 

Further figure 6 comprises twelve vectors 920, all beginning at the systems origin 
and pointing towards the unit circle 913, all having the length of one unit. The twelve 
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vectors 920 (dl . . . dl2) are evenly distributed around the circle 913, causing the an- 
gle 921 between two neighbor vectors to be the same, indifferently to which vector is 
chosen. 

5 Incoming sounds may be defined by these vectors, the direction of the vectors repre- 
senting the direction of the incoming sounds (or rather the direction, fi'om which the 
incomings sound signals are coming), and a number representing the magnitude or 
amplitude of the incoming sounds signals. 

1 0 The number of vectors 920 (twelve) is only an example. It is possible to comprise 
any number of vectors, as long as the number of vectors is sufficient to define the 
incoming sounds satisfactorily. A preferred embodiment would comprise more than 
25 vectors or directions, for example 30, 40, 50 or even more. The higher the number 
of vectors, the higher resolution of direction is achieved. And the higher resolution of 

15 direction, the more accurate source localization is achieved. Itor soimds coming from 
sources placed in front of the head, the human beings are capable of distinguishing 
directional differences, as small as 3 degrees. This is the so-called localization blur. 

The illustrated angle 921 between two neighbor vectors is only an example. It is pos- 
2 0 sible to comprise any principle of vector distribution around the circle, including 
uniform distribution and experience based, e.g. psycho-acoustic based distribution. 

Having in mind that the ears are not quite as good of directional distinguishing of 
sounds coming from behind the head or from the sides of the head, as they are of 
2 5 sounds coming from in front of the head, it is advantageous to comprise a distribu- 
tion with fewer vectors behind the head than in front of the head. This gives a less 
accurate localization behind the head, but the human being will normally not be able 
to tell the difference anyway. A preferred embodiment using this distribution princi- 
ple is shown in fig. 6b. 

30 

Another distribution of the vectors 920 could be based on measures of the density of 
different sounds in all possible directions. The vectors could then be distributed with 
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small angles between them in direction sectors with high density of directional sound 
signals and with larger angles between them in direction sectors with low density of 
directional sound signals. 

A further way of distributing the vectors around the circle could be based on human 
impressions. 

Another perspective of the invention is added when the above format is defined ac- 
cording to a room instead of just a plane. Fig. 7 defines such a room. It is a three- 
dimensional system of co-ordinates, with the axes labeled *x' 930, 'y' 931 and *z' 
932. The axes 930-932 are perpendicular to each other, that is: each axis is perpen- 
dicular to the other two. At the systems origin a head of a person 912 is placed, with 
the nose pointing in the same direction as the x-axis 930. Further two circles 946a 
and 946b have been added. These circles 946a and 946b are placed in parallel with 
the unit circle 933, but are displaced along the z-axis in such a way that they still 
have their centers at the z-axis. Furthermore the distance from the systems origin 912 
to any point at the two circles 946a and 946b are exactly one unit, as the circles are 
placed on a sphere with its center in the systems origin. The circle 933 lying in the x- 
y-plane is called the middle plane circle. The circle 946a displaced along the z-axis 
in the positive direction is called the upper circle. The circle 946b displaced along the 
z-axis in the negative direction is called the lower circle. 

Fig. 8 corresponds to fig. 7. It comprises the three axes 930-932 of the three- 
dimensional co-ordinate system. Further it comprises the imit circle 933 of the x-y- 
plane and the two additional circles 946a, the upper circle, and 946b, the lower cir- 
cle, placed in parallel with the x-y-plane. The circles 946a and 946b are centered at 
the z-axis, and the distance jfrom any point on these circles to the systems origin 912 
is exactly one unit as described above. Further fig. 8 comprises two angles 95 la and 
95 lb. The angle 95 la is the angle between the x-axis and the direction from the sys- 
tems origin 912 to a point at the circle 946a lying exactly above the x-axis. The angle 
951b is defined the same way, yet according to the circle 946b. This way the angles 
951a, 951b indicates the displacement of the circles 946a, 946b. 
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Fig. 9 shows an embodiment of an audio fomiat comprising three dimensions. It 
comprises the elements of figure 8. Further it comprises a number of vectors 920, 
pointing from the systems origin 912 to the middle circle 933. These vectors 920 are 
comparative to the vectors of the two dimensional audio format of figure 8a. 

Further fig. 9 comprises a number of vectors 960a pointing from the systems origin 
912 to the upper circle 946a and a number of vectors 960b pointing from the systems 
origin 912 to the lower circle 946b. 

To be able to distinguish the vectors of these three vector systems, the three circles 
are drawn separately in fig. 10a- 10c. 

Fig. 10a shows the upper circle 946a and its corresponding vectors 960a of the three 
dimensional directional audio format. The angle 951a indicates the displacement of 
the upper circle from the x-y-plane. In addition to these already mentioned elements, 
fig. 10a comprises an angle 971a. It indicates the rotation of a vector 960a from the 
direction of the x-axis 930, with axis of rotation at the z-axis 932. 

Fig. 10b shows the middle circle 933 and its corresponding vectors 920 of the three 
dimensional directional audio format. The angle 921 indicates the angular distance 
between two vectors 920. 

Fig. 10c shows the lower circle 946b and its corresponding vectors 960b of the three 
dimensional directional audio format. The angle 951b indicates the displacement of 
the lower circle from the x-y-plane. The angle 971b indicates the rotation of a vector 
960b from the direction of the x-axis 930. 

Drawn in the same co-ordinate system, fig. 10a- 10c will end up as fig. 9. 
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With reasons as those for the two dimensional audio format, the number of vectors 
corresponding to each circle and the number of circles are only examples, and any 
number of vectors and circles are within the scope of the invention. 

5 As a human being are more capable of directional distinguishing of sources in a hori- 
zontal plane than in a vertical plane, it is not necessary to have the same resolution of 
vectors up- and downwards as it is sidewards. This makes a preferred number of 
horizontal circles 5. These comprise from the top a second upper circle, a first upper 
circle, a middle circle, a first lower circle and a second lower circle. Common to all 
1 0 circes imaginable is that the distance from any point at a circle to the system origin, 
i.e. the head of a human being, is one unit, or in other words, the circles are all placed 
with their perimeters on the surface of a sphere with a radius of one imit. 

A preferred embodiment would also comprise fewer vectors pointing towards each of 
1 5 the upper or lower planes than to the middle plane. This is because the highest reso- 
lution of vectors wanted and also usable to the human being is in the middle plane. 

In a further preferred embodiment an upper circle and/or a lower circle is situated 
near respectively the positive part (i.e. the angle 951a being close to 90"") and the 
2 0 negative part (i.e. the angle 95 lb being close to 90"*) of the z-axis and contains only 
few vectors. Such upper and/or lower circles could even be defined as points on the 
z-axis, whereby only one vector would correspond to these upper and/or lower "cir- 
cles", that is a vector located along the positive direction of the z-axis and/or a vector 
located along the negative direction of the z-axis. 

25 

In an advantageous simple embodiment of the invention the audio format could be 
defined by a middle circlee as described above in combination with a vector along 
the positive part of the z-axis and optionally a vector along the negative part of the z- 
axis. 

30 

The distribution of vectors around a circle is also just an example. As explained for 
the two-dimensional audio format many distribution principles are imaginable and 
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applicable and hence within the scope of this invention. This could be a uniform dis- 
tribution principle as shown in the drawings 9, 10a- 10c, an experience based distri- 
bution principle, a distribution principle based on measures of the localization blur in 
different directions as shown for the two-dimensional system in fig. 6b or a distribu- 
5 tion principle based on measures of the portion of sound gradients in different areas 
for a specific room and situation. 

Further it shall be pointed out that the vectors could be placed in other manners than 
with their ends placed at a circle, especially the vectors, which are placed with an 
1 0 angle in relation to the x-y-plane. The angle 951a or 951 (fig. 8) in relation to the x- 
y-plane could vary as a function of the angle 921 (fig. 6a) if found appropriate, 
whereby the ends of the vectors could be situated on for example non-circular curves 
on the surface of the unit sphere. 

15 In fig. 1 1 and 12 is illustrated signal processing methods and systems, which utilize 
the above-described audio signal formatting system. 

A number of signals, preferably audio signals, si - sM are provided in the signal 
format according to the invention, e.g. comprising N directional components ac- 
2 0 cording to the same directional format. It shall be noted that not all of these compo- 
nents actually need to contain any signal, as the signal format must be expected to 
comprise a relatively large nxmiber of directional components in order to be able to 
represent the involved signal sources satisfactorily. Thus some (or even a large num- 
ber) of the components of the actual signal sources may be zero. 

25 

The audio signals si - sM may be recordings of or (microphone) signals stemming 
from single musical instruments, group of instruments, singers etc. or the signals si - 
sM may be other forms of signals, which will have to be combined to represent a 
resulting audio signal or other forms of signals. 

30 

The source signals si - sM are directed to a signal processing unit 972 or 982, said 
signal processing units serving to combine the source signals si - sM and to provide 
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an output signal 973 or 983, respectively, which also is represented in the audio sig- 
nal format according to the invention, e.g. comprising N directional components, said 
components having the same directions as the components used for representing the 
source signals si - sM. 



10 



15 



In a preferred embodiment the processing involved for combining the source signals 
is a sununing of the corresponding components of each source signal si - sM. The 
summing is carried out as a simple adding of each signal component, i.e. 

Zdl:=dl(sl)+ dl(s2)+ + dl(sM), 

i:d2:=d2(sl)-»- d2(s2)+ + d2(sM), 

Sd3:=d3(sl)+ d3(s2)+ + d3(sM), 

Zd4:=d4(sl)-h d4(s2)-f + d4(sM), 



SdN:=dN(sl)+ dN(s2)+ + dN(sM). 



2 0 Other forms of signal processing may be performed by the signal processing units 
972 and 982, even additional processing not primarily serving to combine the com- 
ponents of the source signals, but serving to amend, equalize, add reverberation to, 
etc. the resulting signal. But preferably such additional processing will be performed 
in a later stage or stages of the signal processing chain but before the final direction 

2 5 rendering unit (DRU) will perform the mapping of the signals to the available sound 

reproducing system, e.g. the loudspeaker system. 

The basic functioning of the direction rendering unit (DRU) will thus be to map the 
N directional signal outputs 973 and 983 from units 972 or 982 into a chosen multi- 

3 0 channel representation, according to the available speaker set-up. 
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The conversion into a given desired P channel representation may be effected in sev- 
eral different ways such as implying HRTF based (head related transfer fimction), a 
technique mentioned as Ambisonics, VBAP (vector based amplitude paiming) or a 
pure experience based subjective mapping. 

5 

Fig. 13 shows a system according to a further aspect of the multi-channel audio for- 
mat and processing method of this invention. It shows a model of a reverberation 
unit. Here a number of units 9101a-9101c, called room simulator units, calculate how 
the sound emitted from a source 9100a-9100c will be heard at the listener's position, 

1 0 including reflections from the room. These room simulator units may for example be 
early pattern generators, EPG's, which will be assumed to be the case in the follow- 
ing. As output each EPG 9101a-9101c has the same listener placed at the same loca- 
tion, but as input the EPG's could have different instruments 9100a-9100c placed at 
different locations. To make the best room-simulation, the EPGs should calculate the 

15 resulting sound from as many directions as possible, and this result 9102a-9102c 
could then be sent on in the audio format of this invention. The result of each EPG 
should in any suitable way be added to form the final sound heard at the listener's 
position, and this addition could be made according to the audio signal processing 
method 9103 of this invention. The result 9104 from passing the outputs from the 

2 0 EPGs through the processing method, would be in the multi-channel audio format of 
this invention. Finally the soxmd in each direction of this format would have to be 
mapped to the available loudspeakers 9106a-9106b or channels chosen by the user. 
This mapping is performed by a direction rendering unit (DRU) 9105. 

2 5 Instead of just representing the sound coming from a particular direction, each vector 
could represent the method of calculating the sound coming from that particular di- 
rection. In this embodiment each vector would comprise a listing of partial sound 
signals as a fimction of time and as a function of the actual input signals. In each 
vector shall then the partial soxmd signals be summed. 

30 

According to a further embodiment of the invention the vectors may represent further 
sound describing information, for example source direction, coloring etc. 
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Fig. 14 illustrates a recording, distribution and/or reproducing system according to a 
further aspect of the invention, said system utilizing the signal fomiat according to 
the first aspect of the invention. 

5 

A number of source signals si — sM, each comprising a number of directional com- 
ponents dl - dN, are led to a pre-processing unit (PPM) 9111. The input source sig- 
nals, which preferably may be audio signals, may each stem from a single musical 
instrument, a singer or another source of audio signals, or may stem from a group of 

1 0 instruments, a group of singers, a group of other audio sources or combinations of 
these. These signals may have been generated using commonly available methods 
and equipment, e.g. microphones, while simultaneously producing the directional 
components of the signals. Further the input soiu-ce signals si - sM, or some of 
these, may have been generated using audio generators and/or simulators, for exam- 

15 pie room simulators, early pattern generators (EPG's) etc. as indicated above. 

In the pre-processing unit (PPM) 9111a processing of the input source signals si - 
sM takes place. This pre-processing may in a simple form be a summing of the cor- 
responding directional components of each input source signal, e.g. 
20 dl:=dl(sl)+dl(s2)+dl(s3)+ ....+ dl(sM), 

d2:=d2(sl )+ d2(s2)+ d2(s3)+ .... +d2(sM), 



25 dN:=dN(sl)+ dN(s2)+ dN(s3)+ .... +dN(sM), 

whereby the output signal IF from the pre-processing unit (PPM) 9111 will be gener- 
ated. 

Other forms of signal processing may be incorporated in the pre-processing imit 
3 0 (PPM) 9111, such as equalizing, coloring, addition of tail sound signals, reverbera- 
tion, delaying etc. Also amendments of the format of the signals may take place, e.g. 
reduction of the number of directional components, canceling of certain directional 
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components, addition of certain components, which may contain infomiation relating 
to the recorded or processed signals. 

The resuhing output signal IF from the pre-processing unit (PPM) 9111 thus com- 
5 prises a number T of signal components, of which most or all are directional compo- 
nents. The number T may equal the number N of directional components in each of 
the input source signals si - sM, or T may be larger than or less than N. 

The output signal IF constitutes an intermediate signal format, which is suitable for 
10 storing, transmission or otherwise distribution of the recorded signals, preferably 

audio signals, while simultaneously retaining as much detailed information about the 
input signals as possible. The output signal IF may thus be stored on any suitable 
form of storing media such as CD*s, DVD's or static storing media of the electronic, 
magnetic, optical etc. variety as illustrated in fig. 15 by the storing means 91 15. 
1 5 Further the output signal IF may be transmitted in any suitable manner, for example 
by distribution via the Internet or other suitable means. 

The T-channel signal IF is received by user processing means (UPM) 9112, which 
may be incorporated in the reproducing system or apparatus at the end-user, for ex- 
2 0 ample when a storing medium such as a CD or a DVD is played on the apparatus of 
the end-user, or when a signal IF is received via electronic, electromagnetic or opti- 
cal communication means, for example via the Intemet, by the reproducing system at 
the end-user. 

2 5 Ordinarily the reproducing equipment at the end-user will not be enable of repro- 

ducing the relatively large number of directional components, i.e. channels, com- 
prised in the signal IF, but will be able to reproduce the signals, preferably the audio 
signals, in one or more of the commonly used bi- or multi-channel systems, e.g. ste- 
reo system, 3-, 4- or 5-channel systems, surround systems etc. or in a user-specified 

3 0 set-up. The user processing means (UPM) 9112 therefore comprises means, for ex- 

ample in the form of a decoder, for transforming the T-channel signal IF into a signal 
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containing a suitable number of channels, which may be reproduced by the end-user 
equipment. 

The user processing means UPM may be predestined to transform the received signal 
from a certain number of channels T to a certain number of channels to be repro- 
duced by the end-user equipment. Alternatively, the user processing means UPM 
may be configured to determine the number of channels T incorporated in the re- 
ceived signal IF and to perform a transformation from this number of channels to a 
certain number of channels to be reproduced by the end-user equipment. Further, the 
user processing means UPM may be able to perform a transformation to one of two 
or more different reproducing systems UFl - UFk, containing a different number of 
channels, in dependence upon an active choice by the user or in dependence upon 
other factors, such as parameters of the received signal IF. 

As illustrated, the user processing means UPM of fig. 14 is configured to reproduce 
the audio input signals in a specific reproducing system labeled UF3, which may be 
any type of the relatively large number of reproducing systems (labeled UFl - Ufk 
on fig 14), which are available at present, and which may reproduce the signals via 
loudspeaker systems using stereo systems, 3-, 4- or 5-channel systems and/or user- 
specified set-ups etc. 

In addition to the means for transforming the number of channels, the user process- 
ing means UPM may further comprise other means for processing the received signal 
IF, for example means for equalizing, other means for coloring the signals, delaying 
means, adding reverberation, dynamic processing etc. These further processing steps 
may be designated in relation to the actual type and character of the user reproducing 
equipment, e.g. in order to achieve an optimal sound reproduction. 

The transformation of the number of charmels T contained in the received signal IF 
to the number of channels, which may be reproduced by the actual end-user equip- 
ment, may be performed using linear transformation of the received signal- 
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components, for example by matrix-operation. More complicated operation may be 
performed to achieve more sophisticated and detailed results of the transfomiation. 

As indicated in fig. 15, the input signals si - sM may originate from microphones or 
similar transducers producing electric signals 9110a - 91 lOM. These signals are 
processed by signal processing means 91 14a - 91 14M, whereby the signals si - sM 
each having N signal components according to the signal format of the invention is 
produced. The signal processing means 9 11 4a - 91 14M may for example be room 
simulators, early pattern generators etc. Some or all of the input signals for these 
signal processing means or for the pre-processing means PPM 9111 may also have 
been produced artificially, for example by electronic music instruments or signal 
generators. 

A further advantageous embodiment of the invention will be described with refer- 
ence to fig. 16. This figure illustrates a system, which corresponds to the system de- 
scribed in connection with fig. 14, but where a further processing means, an interme- 
diate processing means (IPM) 91 13 has been added. This further processing means 
IPM provides an intermediate processing of the multi-channel signal IF having T 
channels (or directional components), whereby the number of channels may be re- 
duced to a number V. The output signal IF' having V channels (or directional com- 
ponents) may be stored on any suitable storing media 91 16 or may otherwise be 
transmitted or distributed to the end-users, represented by the end-user processing 
means UPM, where it may be reproduced as described in connection with fig. 14. 

The output signal IF firom the pre-processing unit (PPM) 9111 may also be stored on 
any suitable storing media (not shown in fig. 16) or may otherwise be transmitted or 
distributed. 

It will be understood that the intermediate processing may be performed by or for a 
record company, a record distributor, a distribution network etc. which has a need to 
distribute or may achieve an advantage by distributing signals with a fewer or a spe- 
cific number of channels. This may thus be performed by the IPM without influenc- 
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ing upon the other parts of the system, e.g. the system as shown in fig. 14 will func- 
tion without regard to the system shown in fig. 15. 

Fig. 1 7 shows different modules of a rendering system according to the invention. 

5 

The system comprises an independent pre-rendering stage PS. 

The pre-rendering stage PS comprises a data carrier reading unit DCRU. The unit 
may e.g. be a DVD player adapted for reading data stored in a DVD or it may e.g. 
1 0 comprise a solid state memory interface. 

The data carrier reading unit DCRU are connected to a direction rendering unit DRU 
by means of an M channel interface. 

1 5 The direction rendering unit DRU comprises method storing means MSM and a sig- 
nal processing unit SPU adapted for transforming the M-channel input into an N- 
channel output according to a rendering method stored in the said memory storing 
means MSM. 

2 0 The direction rendering unit may e.g. perform a relatively simple mapping of the 

input channels into the output channels by means of a gain matrix. The rendering 
method may e.g. be established by means of vector based amplitude panning. 

Moreover, the rendering unit may be adapted for exchanging the rendering method 
25 by means of software modifications. 

The N-channel output of the pre-rendering imit DCRU are then fed to output channel 
connectors (OC). 

3 0 The illustrated pre-rendering unit DCRU comprises only two signal output connec- 

tors. Hence, the pre-rendering unit DCRU is specifically adapted for transforming the 
input channels into a two-channel signal representation, such as stereo. 
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The outputs of the pre-rendering unit DCRU are then fed to a traditional stereo am- 
plifier RA connected with two loudspeakers LS. 

5 According to the invention, the pre-rendering stage may thus be adapted for fitting to 
an arbitrary combination of amplifiers and loudspeakers, e.g. a surround sound sys- 
tem. 

Evidently, another possible implementation of the invention would be a pre- 
1 0 rendering unit DCRU comprising a number of output connectors, e.g. ten. The pre- 
rendering unit DCRU may then, under control of the stored rendering method apply 
1 to 10 physical outputs. 

Evidently, the amplifier means may be incorporated in the pre-rendering unit DCRU 
15 within the scope of the invention. 

The direction rendering unit DRU may also comprise a set of rendering methods 
storing the methods storing means. The rendering methods may both be different 
with respect to basic properties determines by the number of output channels, but it 
2 0 may also be different with respect to the intended positioning of the loudspeaker of 
the rendering system. Accordingly, the illustrated stereo rendering system may com- 
prise e.g. twenty different predefined "stereo" variants with respect to e.g. the posi- 
tioning of the loudspeakers in the room or e.g. room characteristics. 

25 Such rendering may e.g. imply phase modification, equalizing, sound coloring, sound 
compression or the like. 
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CLAIMS 

1 . Audio signal format comprising 
N components (dl, d2, d3, ... dN), 

each of the said components (dl, d2, d3, . . . dN) representing a direction, said com- 
ponents preferably being uncorrelated. 

2. Audio signal format according to claim 1 comprising 
N components (dl, d2, d3, . . . dN), where N is at least 3. 

3. Audio signal format according to claim 1 or 2 comprising 

N components (dl, d2, d3, ... dN), where N is at least 10, preferably at least 20. 

4. Audio signal format according to claim 1-3, wherein the said directions are three- 
dimensional directions. 

5. Audio signal format according to claim 1-4, wherein some or all of said directions 
are angled in relation to a common reference plane and where preferably all of said 
directions to one and the same side of the plane have been placed with approximately 
the same angle in relation to the common reference plane. 

6. Audio signal format according to claim 1-5, wherein directions are placed on both 
sides of the common reference plane, where some or all of said directions are angled 
in relation to the common reference plane and where preferably all of said directions 
to one and the same side of the plane have been placed with approximately the same 
angle in relation to the common reference plane. 
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7. Audio signal format according to claim 1-6, wherein the angle of the directions on 
one side of the common reference plane and the angle of the directions on the other 
side of said plane are substantially equal. 

5 8. Audio signal format according to one or more of claims 1 - 7, wherein the said 
directions are distributed among all possible directions. 

9. Audio signal format according to claims 1-8, wherein the said directions are dis- 
tributed with a larger proportion of directions in areas with a relatively high density 

10 of sound signals than in areas with a relatively low proportion of sound signals. 

10. Audio signal format according to claim 1-8, wherein the said directions are dis- 
tributed with a larger proportion of directions in areas in which the human perception 
of sound signals is relatively sharp, e.g. in front of the head. 

15 

1 1 . Method of representing an audio signal, wherein said signal is decomposed to a 
signal comprising N directional components and according to an audio signal format 
as characterized in one or more of claims 1-10. 

2 0 12. Method of processing audio signals, 

said signals comprising 

M sub-signals (si, s2, s3, ... sM), 

25 

each of the said sub-signals (si, s2, s3, ... sM) comprising 
N components (dl, d2, d3, ... dN), 

3 0 each of the said components (dl , d2, d3, . . . dN) representing a direction; 
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where the said sub-signals (si, s2, s3, ... sM) are added to form a sum-signal (Ls) 
comprising 

N components (Zdl, Zd2, SdS, . . . EdN), 

5 

each of the said components (Edl, Zd2, 2d3, ... ZdN) representing a direction, 

each of the said components (Sdl, 2d2, 2d3, ... 2dN) being the sum of the said M 
sub-signals (si, s2, s3, ... sM) corresponding components (dl, d2, d3, ... dN). 

10 

13. Method of processing audio signals, 
said signals comprising 
15 M sub-signals (si , s2, s3, ... sM), 

each of the said sub-signals (si, s2, s3, ... sM) comprising 
N components (dl , d2, d3, ... dN), 

20 

each of the said components (dl, d2, d3, . . . dN) representing a direction; 

where said sub-signals (si, s2, s3, sM) are results of a room-simulation using 
room-simulators, preferably multi-directional room-simulators, 

25 

where the said sub-signals (si, s2, s3, ... sM) are added to fomi a sum-signal (Ss) 
comprising 

N components (Zdl, Ld2, Zd3, . . . SdN), 

30 

each of the said components (Zdl, Zd2, 2>i3, ... LdN) representing a direction, 
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each of the said components (Sdl, 2d2, Sd3, - 2dN) being the sum of the said M 
sub-signals (si, s2, s3, . . . sM) corresponding components (dl, d2, d3, ... dN). 

14. Method of representing an audio signal (AS), said method comprising the step of 

5 

establishing at least two directional signal components (M), said directional signal 
components (M) preferably being uncorrelated. 

15. Method of representing an audio signal according to claim 14, 
10 whereby the said audio signal is a room processed signal. 

16. Method of combining signals established according to the method according to 
claim 14 and 15, whereby at least two audio signals (AS) are combined into one sig- 
nal by means of an adding. 

15 

1 7. Method of decoding a number (M) of directional components into a, preferably 
lower, number (N) of directional components, said method comprising the step a 
transforming of the input directional components to a number N of output directional 
components, 

2 0 said input directional components representing a room simulated audio signal, said 
input directional components being preferably uncorrelated. 

18. Rendering system comprising at least one input for receiving a number (M) of 
directional components (DC), said rendering system comprising means for trans- 

2 5 forming the input directional components into a number (N) of output channels ac- 

cording to at least one rendering method stored in associated storing means (MSM). 

19. Rendering system according to claim 18, 

wherein one at least of the said methods implies a transfomiing according to a gain 

3 0 matrix. 



20. Rendering system according to claim 18 or 19, 
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wherein the said method stored in the said storing means may be exchanged by 
means of a suitable software transmitting and/or receiving interface. 

21 . Rendering system according to claims 18-20, 

5 wherein the rendering system comprises a user interface adapted for selecting at least 
two different predefined rendering methods stored in said storing means. 

22. Rendering system according to claims 18-21, wherein 

said system comprising a set of output channel (OC) connectors of which the ren- 
1 0 dering method may define a subset of output channel connectors to be activated 
when applying the transforming defined by the said. 

23. Multi-channel data carrier, said data carrier comprising a number (M) of audio 
channels (M), at least two of said channels representing a directional signal with re- 

15 spect to a virtual listener/reference position (VLP). 

24. Multi-channel data carrier according to claim 23, wherein the directional chan- 
nels are established independently of a subsequent rendering 

2 0 25. Multi-charmel data carrier according to claim 23 or 24, 

wherein the number of directional channels are at least eight, preferably at least 
twenty. 

26. Multi-channel data carrier according to claim 23-25, 

2 5 wherein at least two of the directional channels are uncorrelated. 

27. Multi-channel data carrier according to claim 23-26, 

wherein at least two of the directional channel are stored at the data carrier in a com- 
pressed state. 

30 
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